Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise
نویسندگان
چکیده
A complete set of frequent itemsets can get undesirably large due to redundancy. Several representations have been proposed to eliminate the redundancy. Existing generator based representations rely on a negative border to make the representation lossless. However, negative borders of generators are often very large. The number of itemsets on a negative border sometimes even exceeds the total number of frequent itemsets. In this paper, we propose to use a positive border together with frequent generators to form a lossless representation. A set of frequent generators plus its positive border is always no larger than the corresponding complete set of frequent itemsets, thus it is a true concise representation. The generalized form of this representation is also proposed. We develop an efficient algorithm, called GrGrowth, to mine generators and positive borders as well as their generalizations.
منابع مشابه
A New Concise and Lossless Representation of Frequent Itemsets Using Generators and A Positive Border
A complete set of frequent itemsets can get undesirably large due to redundancy when the minimum support threshold is low or when the database is dense. Several concise representations have been proposed to eliminate the redundancy. Existing generator based representations rely on a negative border to make the representation lossless. However, negative borders of generators are often very large...
متن کاملEmerging Cubes: Borders, size estimations and lossless reductions
Discovering trend reversals between two data cubes provides users with a novel and interesting knowledge when the real world context fluctuates: What is new? Which trends appear or emerge? Which tendencies are immersing or disappear? With the concept of Emerging Cube, we capture such trend reversals by enforcing an emergence constraint. We resume the classical borders for the Emerging Cube and ...
متن کاملNegative Generator Border for Effective Pattern Maintenance
In this paper, we study the maintenance of frequent patterns in the context of the generator representation. The generator representation is a concise and lossless representation of frequent patterns. We effectively maintain the generator representation by systematically expanding its Negative Generator Border. According to our literature review, no prior work has studied the maintenance of the...
متن کاملExploring the Disjunctive Search Space towards Discovering New Exact Concise Representations for Frequent Patterns
Extracting concise representations seems to be a milestone towards the emerging knowledge extraction field. In fact, it is a quite survival reflex towards providing a manageably-sized and reliable knowledge. Thus, we bashfully witness the emergence of a trend towards extracting concise representations, e.g., closed patterns, non-derivable patterns and essential patterns. The essential pattern-b...
متن کاملNon-Derivable Item Set and Non-Derivable Literal Set Representations of Patterns Admitting Negation
The discovery of frequent patterns has attracted a lot of attention of the data mining community. While an extensive research has been carried out for discovering positive patterns, little has been offered for discovering patterns with negation. The main hindrance to the progress of such research is huge amount of frequent patterns with negation, which exceeds the number of frequent positive pa...
متن کامل